LTX-2: Efficient Joint Audio-Visual Foundation Model